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FIG. 1 
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Purification Result for Endo- 0 -N-acetylglucosaminidase (15-25% gradient SDS-PAGE) 

Lane 1 : Purified endo- 0 -N-acetylglucosaminidase from Mucor hiemalis 
Lane 2: Molecular weight markers 
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FIG. 2 



Novel endo- # -N-acetylglucosaminidase gene 



Not I p vu II Hind III Bel I Sac I EcoR I EcoT 221 Sal I 




Restriction enzyme map for pZL-Endo including the full-length 
novel endo- & -N-acetylglucosaminidase gene. 
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FIG. 3 

10 20 30 40 50 60 

GTCGAOCCAC OCGTTXOCOG ACGCGTOGGC GGACCCGTGG GCGGACGCGT GGGTTTTATr 

70 80 90 100 110 120 

TTACATAAAT ATOCCTTCAC TTCAATTGCA ACCTGATGAC AAACTAGCAC CIVITIVI ' IT 

130 140 150 160 170 . 180 

TOCACTTAAG TCTATGAATG AGTTGAGGGA CTGGACGCCA GACGAAAAGA TAAAGTTTAA 

190 200 210 220 230 .240 

CGTTTCAAGC GTQGCACTAC AGCCTCGTGT GAAAAACGCC CTGAAACCTC AATTATTGTT 

250 260 270 280 290 300 

AACICA0GAT ATQGCAGGAG GATATAAAGA AGATAAAAAT ATTCAAGGAA ACAATTATAA 

310 320 330 340 350 360 

AGACATTTAT AACATTCAAT ATTGGCATTT AGCTGATACT TTTGTATATT TCTCTCATGA 

370 380 . 390 400 410 420 

GCGAGTTAGC ATTCCTCCAG TCAATTGGAC AAATGCTTGT CATAGAAATG GTGTAAAGTC 

430 440 450 460 470 480 

TTTAGGTACT TTITTAGTAG AAGGAAATAA CCAAATGCAT GAAATGGAAG CCTCGCTTCA 

490 500 510 520 530 540 

COGTCCACCT TTACTTAATA ACACTGACGA CCCTATGAGA TTATGGAGTC CGTATTATGC 

550 560 570 580 .590 .600 

AGACCAAXffi GTTGCTATTG CTAAACACTA TGGTTTTGAT GGCTGGTTGT TCAATATTGA 
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'"Z, 610 620 630 640 650 660 

'41 ATOCGAAXTC TTTCCTTITC CTACAAATCC AAAATTCAAA GCTGAAGAGT TGGCAAAGTT 

7* 670 680 690 700 710 720 

— TCTACACTAT TTCAAGGAAA AATTOCATAA CGAAATACCT GGATCTCAAC TCATTTGGTA 

iB 

730 740 750 760 770 780 

J* CGACAGCATG ACAAATGAAG GAGAAATCCA CTGGCAGAAC CAGCTCACAT GGAAAAATGA 

«pj 790 800 810 820 830 840 

\iZ GTTATnTIT AAAAACACGG ATOGTATTTT TTTGAATTAT TGGTGGAAAA AAGAATACCC 

Li 

fi 850 860 870 880 890 900 

TGAAATQQCG CGTAGACTAG CTGAAGGAAT AGGTAGATCA GGTTTAGAAG TTTATITrGG 

910 920 930 940 950 960 

TACAGATGTA TOQGGAAGQC ATACTTATGG TGGCGGTGGT TTCAAATCAT ATAAGGGTGT 

970 980 990 1000 1010 . 1020 

AAAAACTOCC TACTCTGCAA TGACATCTTC TGCATTATTT GGTATGGCAT GGACATACGA 

1030 1040 1050 1060 1070 1080 

GCATTTCGAA AAGTCTGAAT TTGAAAAGAT GGATCGTTTG TTTTGGTGTC GT GGTA AATA 

1090 1100 1110 1120 1130 1140 

CTCTGACTAT CCfCCCCCAC CTCCTAAAAA CCCAGATGAC GAAAAAGAAG TAGAAAGCGA 

1150 1160 1170 1180 1190 1200 

TGATAGTGAA GATGAGCTCA TGTACGGACA CAAGAAAGGT ATTGCTGACA CGGTAGAATC 

1210 1220 12 30 1240 1250 1260 

TATTCCTCT3V CCAGGAACAG ATTGGTTTGT TACCAATTTT GATAGGGGGT TTGGAAATAG 

1270 1280 129 0 1300 1310 1320 

GTTTEATTAT AGAGGAAAGA GATTACTTTC TCAGCCTTGG TCCCATTTAT CGCATCAAGC 

1330 1340 1350 1360 1370 1380 

TATTCTCCCC AATAAAAOCT ATCGAAATCC AGAGATTTAT CCCACTGATC AAAACATTAA 



Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- & -N-acetylglucosaminidase gene. 
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"™ „ I* 00 1410 1420 1430 1440 

AATCACTAGT TCTdCGATT GOGATCATGG AGCTTITCTr GGTGGAACCT CGCTTATTAT 

14S0 1460 1470 1480 1490 isnn 

CAAAQGCCAA CGTTTCAATC ATAGAGAATC GCATCATCTT GAAACTGAAA TTAGTATACC 

151 ° 1520 1530 1540 1550 isfio 

TCTGTATAAG CTCTCATTAG ATGCTAGTAA AGGATCCTCA TTGCGTTATA TTTATAGAAC 

1 5 7 0 1580 1590 1600 1610 1620 

TTTCTTGATG AAAGATGTAA AGTTGACAGT AGCATGTCAC TTTTCGTTAA AAACAAACGA 

1630 1640 1650 1660 1670 16B0 

CTCAGTEAAT TTCTTCAAGG TATGGCAGCC AGATGAAAAT TO11T1 TO AATATGATGA 

1690 1700 1710 1720 1730 1740 

TGGAATGAGA GCCACTGTTA CAACTGAAAA TTCTACCGAA AGCAGATCCT TITTA1TACG 

1750 1760 1770 1780 1790 1800 

TACAACAGAA GAAGATACAG GAGAAAATGA 1TGGATAACA AAAACTATTA ATGTGCCTCC 

18 10 1820 1830 1840 1850 1860 

TGTTOCAGAA GGAAGTCAAT TATACATTAC AAGACTTGAA GTGAGCGTAG TATTAGATAC 

187 0 1880 1890 iqnn i.g 10 i52 o 

AGCTGGTTTA GTAGGTCTTC TTAATCAAGT tattgcttcc TTGGGATATA ttagcatcat 

1530 1940 1950 I960 1970 1980 

ACCAACTATA AATTCTGGAA TAAAAACAGA TTCATCACGC ATTATTCAGG ATCTCTTTTG 

1990 2000 2010 2020 2030 2040 

GAAAGATCAG AAATATACCA AAATCGGAAA AGAAAGTTTA GACGACATAG CTCAAGAAGA 

> ^_ 2050 2060 207 ° 2080 2090 2100 

AGTTCATAGA TATTATGGAA CATTGAACTT3 GGAAAACACA GCAAATGTAG TAAACGCTTT3 

2110 2120 2130 2140 2150 2160 

GGAGGAAATA. GATTACTACA ACGTTmTA CAAAGAAAGT GACGACTCTG CAACTCGCAT 

2 170 2180 2190 2200 2210 2220 

CTITITAGGA ACAGCATTCT GTAATCAATT TCGTGTATCT GGTTTAGATA 1TATTTTATC 

2230 2240 2250 2260 2270 2280 

TAAGCTACCA AAGATAGTTA TTCAAGCTGT TAACAAAGAA GGATACATCT CITCAAGTGG 

2290 23 °0 2310 2320 2330 2340 

TAGCATAGAT TTGTCATTAA ACTAGGACTT GAAATAAAAT ATTATGATAA AGAAAAAAAA 

2350 2360 2370 2380 2390 ?dnn 

AAAAAAAAAA AAAAAAAAAG GGCGGCCGC 



Entire nucleotide sequence of the fragment inserted into the Sal I-Not I sites of pZL-Endo 
including the full-length novel endo- /9 -N-acetylglucosaminidase gene. (Continued) 
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FIG. 5 09/700993 

9 18 27 36 45 54 

5' ATC CCT TCA CTT CAA TTG CAA CCT GAT GAC AAA CTA GCA CCT GTT TCT ITT GCA 

MPSLQLQPDDKLAPVSPA 

63 72 81 90 99 108 

CTT AAG TCT ATC AAT GAG TTG AGG GAC TOG ACG OCA GAC GAA AAG ATA AAG TIT 

LKSMNELR DWT PDEKIKF 

117 126 135 144 153 162 

AAC GTT TCA AGC GTG GCA CTA CAG CCT CGT GTC AAA AAC GCC CTG AAA CCT CAA 

NVSSVALQPRVKNAL KPQ 

171 180 189 198 207 216 

TEA TIG TTA ACT CAT GAT ATC GCA GGA GGA TAT AAA GAA GAT AAA AAT ATT CAA 

LLLTHD MAGGYKEDKNIQ 

225 234 243 252 261 270 

GGA AAC AAT TAT AAA GAC ATT TAT AAC ATT CAA TAT TOG CAT TEA GCT GAT ACT 

GNN'Y KD I Y N I Q YWHLADT 

279 288 297 306 315 324 

TIT GTA TAT TIC TCT CAT GAG CGA GTT AGC ATT CCT CCA GTC AAT TOG ACA AAT 

F V Y F S K E R V S I P P V N W T N 

333 342 351 360 369 378 

GCT TCT CAT AGA AAT GGT GTA AAG TCT TEA GGT ACT TIT TEA GTA GAA GGA AAT 

ACHRNGVKCLGTFLVEGN 

387 396 405 414 423 432 

AAC CAA ATC CAT GAA ATC GAA GCC TTG CTT CAC GGT OCA CCT TTA CTT AAT AAC 

NQMHEMEALLHG PPLLNN 

441 450 459 468 477 486 

ACT GAC GAC CCT ATC AGA TEA TOG AGT COG TAT TAT GCA GAC CAA TEA GTE GCT 

TDDPMRLWSPYYADQLVA 

495 504 513 522 531 540 

ATE GCT AAA CAC TAT GGT ITT GAT GGC TOG TIG TIC AAT ATE GAA TCC GAA TIC 

IAKHY GFDGWL FNIECEF 

549 558 567 576 585 594 

TIT CCT TTT CCT ACA AAT CCA AAA TIC AAA GCT GAA GAG TTC GCA AAG TIT CTA 

FPFPTNPKFKAEELAKFL 

603 612 621 630 639 648 

CAC TAT TIT AAG GAA AAA TTC CAT AAC GAA ATA CCT GGA TCT CAA CIC ATE TOG 

HYFKE KLHNE I PGSQLIW 

657 666 675 684 693 702 

TAC GAC AGC ATC ACA AAT GAA GGA GAA ATC CAC TOG CAG AAC CAG CTC ACA TOG 

YDSM. TNEGE I HWQNQLTW 



Amino acid sequence deduced from the novel Endo- /3 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. 
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FIG. 6 



711 720 729 738 747 756 

AAA AAT GAG TTA TIT TTT AAA AAC ACG GAT GGT ATT TTT TTG AAT TAT TGG TOG 



765 774 783 792 801 810 

AAA AAA GAA TAG OCT GAA ATG GCG OGT AGA GTA GCT GAA GGA ATA GGT AGA TCA 

K K E Y PEMAR RVAEG IGRS 

819 828 837 846 855 864 

GGT TEA GAA GTT TAT. TIT GGT ACA GAT GTA TOG GGA AGG CAT ACT TAT GGT GGC 



873 882 891 900 909 918 

GGT GGT TIC AAA TCA TAT AAG GGT GTA AAA ACT GCC TAC TCT GCA ATG ACA TCT 

GGFKSYK GVKT AYSAMTS 

927 936 945 954 963 972 

TCT GCA TEA TTT GGT ATG GCA TGG ACA TAC GAG CAT TIC GAA AAG TCT GAA TTT 

SALFGMAWTYEHFEKSEF 

981 990 999 1008 1017 1026 

GAA AAG ATG GAT OGT TTG TTT TGG TGT GGT GGT AAA TAC TCT GAC TAT CCT CCC 

EKMD RLFWCGGKYSDYPP 

1035 1044 . 1053 1062 1071 1080 

CCA CCT CCt AAA AAC CCA GAT GAC GAA AAA GAA GTA GAA AGC GAT GAT AGT GAA 



1089 1098 1107 1116 1125 1134 

GAT GAG CIC ATG TAC GGA CAC AAG AAA GGT ATT GCT GAC ACG GTA GAA TCT ATT 

DELMYGHKKG IA DTVES I 

1143 1152 1161 1170 1179 1188 

CCT GTA CCA GGA ACA GAT TGG TTT C7TT ACC AAT TTT GAT AGG GGG TTT GGA AAT 

PVP GTDWF VTNFDRGFGN 

1197 1206 1215 1224 1233 1242 

AGG TTT TAT TAT AGA GGA AAG AGA TEA CTT TCT CAG CCT TOG TCC CAT TEA TOG 



1251 1260 1269 1278 1287 1296 

CAT CAA GCT ATT CTC CCC AAT AAA AGC TAT CGA AAT CCA GAG ATT TAT CCC ACT 

HQ AILPNKSY. RNPEIYPT 

1305 1314 1323 1332 1341 1350 

GAT CAA AAC ATT AAA ATC ACT AGT TCT CTC GAT TGC GAT CAT GGA GCT TTT CTT 

DQNIKITSSLDCDHGAFL 

1359 1368 1377 1386 1395 1404 

GGT GGA ACC TOG CTT ATT ATC AAA GGC CAA OGT TIC AAT CAT AGA GAA TOG CAT 

GGTSLI IKGQRFNHRESH 

1413 1422 1431 1440 1449 1458 

GAT GTT GAA ACT GAA ATT AGT ATA CCT CTG TAT AAG CTT TCA TEA GAT GCT ACT 



Amino acid sequence deduced from the novel Endo-0 -N-acetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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AAA 


1467 
GGA TGC 


TCA 


1476 
TIG OCT 


TAT 


1485 
■ATT TAT 


AGA 


1494 
ACT TTG 


TTG 


1503 
ATG AAA 


GAT 


1512 
GTA AAG 
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G C 
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L 'R 
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I Y 
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T L 


L 


M K 


D 


V K 


TTG 


1S21 
ACA GTA 


OCA 


1530 
TGT CAC 


TTT 


1539 
TOG TTA 


AAA 


1548 
ACA AAC 


GAC 


1557 
TCA GTT 


AAT 


1566 
TTC TTC 


L 


T V 


A 


C H 


F 


S L 


K 


T N 


D 


S V 


N 


F F 


AAG 


1575 
GTA TOG 


CAG 


1584 
CCA GAT 


GAA 


1593 
AAT TTC 


TCT 


1602 
TTT GAA 


TAT 


1611 
GAT GAT 


GGA 


1620 
ATG AGA 


K 


V W 


Q 


P D 


E 


N F 


S 


F E 


Y 


O O 


G 


M R 


GCC 


1629 
ACT GTT 


ACA 


1638 
ACT GAA 


AAT 


1647 
TCT ACC 


GAA 


1656 
AGC AGA 


TGC 


1665 
TTT TTA 


TTA 


1674 
OCT ACA 


A 


T V 


T 


T E 


N 


S T 


E 


S R 


C 


F L 


L 


R T 


ACA 


1683 
GAA GAA 


GAT 


1692 
ACA GGA 


GAA 


1701 
AAT GAT 


TOG 


1710 
ATA ACA 


AAA 


1719 
ACT ATT 


AAT 


1728 
GTG OCT 


T 


E S 


D 


T G 


E 


N. D 


W 


I T 


K 


T I 


N 


V P 


OCT 


"1737 
GTT CCA 


GAA 


1746 
GGA ACT 


CAA 


1755 
TTA TAC 


ATT 


1764 
ACA AGA 


CTT 


1773 
GAA GTG 


AGC 


1782 
GTA GTA 



1791 1800 1809 1818 1827 1836 

TTA GAT ACA OCT GGT TTA GTA GGT CTT GTT AAT CAA GTT ATT OCT TGC TTG GGA 



1845 1854 1863 1872 1881 1890 

TAT ATT AOC ATC ATA CCA ACT ATA AAT TCT GGA ATA . AAA ACA GAT TCA TCA OGC 



1899 1908 1917 1926 1935 1944 

ATT ATT CAG GAT CTC TTT TOG AAA GAT CAG AAA TAT AOC AAA ATC GGA AAA GAA 



1953 1962 1971 1980 1989 1998 

AGT TEA GAC GAC ATA OCT CAA GAA GAA GTT CAT AGA TAT TAT GGA ACA TTG AAC 



s 


L D 


D 


I A 


Q 


E E 


V 


H R 


Y 


Y G 


T 


L N 


TOG 


2007 
GAA AAC 


ACA 


2016 
OCA AAT 


GTA 


2025 
GTA AAC 


OCT 


2034 
TOG GAG 


GAA 


2043 
ATA GAT 


TAC 


2052 
TAC AAC 


W 
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T 


A N 


V 


V N 


A 


W E 


E 


I D 
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Y N 


GTT 


2061 
TTT TAC 


AAA 


2070 
GAA AGT 


GAC 


2079 
GAC TCT 
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2088 
ACT OGC 
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GGA 


2106 
ACA GCA 
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D S 
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T R 
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T A 


TTC 


2115 
TGT AAT 


CAA 


2124 
TTT CGT 


GTA 


2133 
TCT GGT 


TTA 


2142 
GAT ATT 


ATT 


2151 
TTA TCT 


AAG 


2160 
CTA CCA 


F 


C M 


Q 


F R 


V 


S G 


L 


D I 


I 


L S 


K 


L P 


AAG 


2169 
ATA GTT 


ATT 


2178 
GAA OCT 


GTT 


2187 
AAC AAA 


GAA 


2196 
GGA TAC 


ATC 


2205 
TCT TCA 


AGT 


2214 
GGT AGC 
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I V 


I 


B A 


V 


N K 


E 


G Y 
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s s 
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G S 


AZA 


2223 
GAT TTG 


TCA 


2232 
TTA AAC 


TAG 
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Amino acid sequence deduced from the novel Endo- /? -N-aeetylglucosaminidase gene, and 
the nucleotide sequence of the DNA encoding this amino acid sequence. (Continued) 
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FIG. 8 

Novel endo- # -N-acetylglucosaminidase gene 



Not I EcoT22I EcoRI Sac I Bell Hind III Pvu II Not I 




2.5 kbp 



Structure of expression vector pGEndo-SC for the use in Saccharomyces cerevisiae, which 
comprises a novel endo- j3 -N-acetylglucosaminidase gene. 
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Expression of endo- 0 -N-acetylglucosaminidase enzyme in yeast into which 
an endo- 0 -N-acetylglucosaminidase gene has been introduced. 

Lanes 1-3: Cellular extract of 5. cerevisiae YPH500 (pep4) into which 

an endo- 0 -N-acetylglucosaminidase gene has been introduced. 
Lane 4: Purified endo- 0 -N-acetylglucosaminidase derived from M. hiemalis 
Lane 5: Cellular extract of 5. cerevisiae YPH500 (pep4) 
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